A Large-Grain Parallel Sparse System Solver

نویسندگان

  • Kyle A. Gallivan
  • Bret A. Marsolf
  • Harry A. G. Wijshoff
چکیده

The eeciency of solving sparse linear systems on parallel processors and more complex multicluster architectures such as Cedar is greatly enhanced if relatively large grain computational tasks can be assigned to each cluster or processor. The ordering of a system into a bordered block upper triangular form facilitates a reasonable large-grain partitioning. A new algorithm which produces this form for unsymmetric sparse linear systems is considered and the associated factorization algorithm is presented. Computational results are presented for the Cedar multiprocessor. Several techniques have been proposed to solve large sparse systems of linear equations on parallel processors. A key task which determines the eeectiveness of these techniques is the identiication and exploitation of the computational granularity appropriate for the target multiprocessor architecture. Many algorithms assume special properties such as symmetric positive deeniteness or exploit knowledge of the application from which the system arises e.g., nite element problems. In this paper, we give an overview of the use of a new ordering technique, the hybrid ordering (H*), and an associated factorization algorithm for unsymmetric unstructured sparse linear systems. More detail on the reordering can be found in 7] and on the merger of the reordering and the factorization algorithm for multicluster architectures in 4]. 1. The Hybrid Ordering. The hybrid ordering H* is composed of two diierent types of orderings: unsymmetric and symmetric. The unsymmetric ordering changes the associated graph of the matrix, mostly by row or column interchanges. The symmetric orderings only relabel the nodes of the associated graphs and maintain certain properties of the system, e.g., symmetry, diagonal dominance. The symmetric orderings are used to obtain a bordered block triangular matrix. The unsymmetric ordering is used to enhance the numerical properties of the matrix.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of a Fully Parallel Sparse Solver

The performance of a fully parallel direct solver for large sparse symmetric positive deenite systems of linear equations is demonstrated. The solver is designed for distributed-memory, message-passing parallel computer systems. All phases of the computation, including symbolic processing as well as numeric factorization and triangular solution, are performed in parallel. A parallel Cartesian n...

متن کامل

pARMS: A Package for the Parallel Iterative Solution of General Large Sparse Linear System ∗ User’s Guide

For many large-scale applications, solving large sparse linear systems is the most time-consuming part. The important criteria for a suitable solver include efficiency, robustness, and good parallel performance. The Parallel Algebraic Recursive Multilevel Solver (pARMS) [8] is a suite of distributed-memory iterative accelerators and preconditioners targeting the solution of general sparse linea...

متن کامل

A Parallel Frontal Solver For Large Scale Process Simulation and Optimization

For the simulation and optimization of large-scale chemical processes, the overall computing time is often dominated by the time needed to solve a large sparse system of linear equations. We present here a new parallel frontal solver which can signi cantly reduce the wallclock time required to solve these linear equation systems using parallel/vector supercomputers. The algorithm exploits both ...

متن کامل

A Parallel Frontal Solver For Large Scale Process Simulation

For the simulation and optimization of large-scale chemical processes, the overall computing time is often dominated by the time needed to solve a large sparse system of linear equations. We present here a new parallel frontal solver which can signiicantly reduce the wallclock time required to solve these linear equation systems using parallel/vector supercomputers. The algorithm exploits both ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989